Nonparametric Generation of Synthetic Data Using Copulas

نویسندگان

چکیده

This article presents a novel nonparametric approach to generate synthetic data using copulas, which are functions that explain the dependency structure of real data. The proposed method addresses several challenges faced by existing generation techniques, such as preservation complex multivariate structures presented in By all information from and verifying generated follows same behavior under homogeneity tests, our is significant improvement over techniques. Our easy implement interpret, making it valuable tool for solving class imbalance problems machine learning models, improving generalization capabilities deep anonymizing finance healthcare domains, among other applications.

برای دانلود باید عضویت طلایی داشته باشید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Synthetic Data Generation using Benerator Tool

Datasets of different characteristics are needed by the research community for experimental purposes. However, real data may be difficult to obtain due to privacy concerns. Moreover, real data may not meet specific characteristics which are needed to verify new approaches under certain conditions. Given these limitations, the use of synthetic data is a viable alternative to complement the real ...

متن کامل

Generation of degree-correlated networks using copulas

Different dynamical processes on complex networks such as epidemic spreading, information propagation or cascading phenomena are highly affected by the underlying topologies as characterized by, for instance, the degree-degree association. Here, we introduce the concept of copulas in order to artificially generate random networks with a rich a priori degree-degree association structure. The acc...

متن کامل

Shape-based scenario generation using copulas

4 The purpose of this article is to show how the multivariate structure (the ”shape” of the dis5 tribution) can be separated from the marginal distributions when generating scenarios. To do 6 this we use the copula. As a result, we can define combined approaches that capture shape with 7 one method and handle margins with another. In some cases the combined approach is exact, in 8 other cases, ...

متن کامل

Using distortions of copulas to price Synthetic CDOs

This paper uses distortions of the bivariate Gaussian copula to produce a heavy tail for expected portfolio loss distribution in the context of synthetic Collateralized Debt Obligations (CDOs). We demonstrate that when the distorted copulas are used within the JP Morgan CDO pricing formula, as an example, we can simulate quite realistic tranche prices. Furthermore, we need only one dependence p...

متن کامل

Scenario Generation Employing Copulas

stochastic programs are effective for solving long-term planning problems under uncertainty. Such programs are usually based on scenario generation model about future environment developments. In the present paper, the scenario model is developed for the case when enough data paths can be generated, but due to solvability of stochastic program the scenario tree has to be constructed. The propos...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Electronics

سال: 2023

ISSN: ['2079-9292']

DOI: https://doi.org/10.3390/electronics12071601